Human posture feature extraction in personal emergency response systems and methods

ABSTRACT

A non-wearable Personal Emergency Response System (PERS) architecture is provided, implementing RF interferometry using synthetic aperture antenna arrays to derive ultra-wideband echo signals which are analyzed and then processed by a two-stage human state classifier and abnormal states pattern recognition. Systems and methods transmit ultra-wide band radio frequency signals at, and receive echo signals from, the environment, process the received echo signals to derive a spatial distribution of echo sources in the environment using spatial parameters of the at least one transmitting and/or receiving antennas, and estimate postures human(s) in the environment by analyzing the spatial distribution with respect to echo intensity. The antennas may be arranged in several linear baselines, implement virtual displacements, and may be set into multiple communicating sub-arrays. The decision process is carried out based on the instantaneous human state (local decision) followed by abnormal states patterns recognition (global decision).

CROSS REFERENCE TO RELATED APPLICATIONS

This application is a continuation-in-part of U.S. patent application Ser. No. 15/008,460 filed on Jan. 28, 2016, which in turn is a continuation-in-part of and claimed priority from U.S. patent application Ser. No. 14/983,632, filed on Dec. 30, 2015, which in turn is a continuation-in-part of U.S. patent application Ser. No. 14/753,062, filed on Jun. 29, 2015, all of which are incorporated herein by reference in their entirety.

FIELD OF THE INVENTION

The present invention relates to the field of elderly monitoring using ultra-wide band interferometry, and more particularly, to human posture feature extraction in personal emergency response system (PERS).

BACKGROUND OF THE INVENTION

Elderly people have a high risk of falling, for example, in residential environments. As most of elder people will need immediate help after such a fall, it is crucial that these falls are monitored and addressed in real time. Specifically, one fifth of falling elders are admitted to hospital after staying on the floor for over one hour following a fall. The late admission increases the risk of dehydration, pressure ulcers, hypothermia and pneumonia. Acute falls lead to high psychological effects of fear and negatively impact the quality of daily life.

Most of the existing personal emergency response systems (PERS), which take the form of fall detectors and alarm buttons, are wearable devices. These wearable devices have several disadvantages. First, they cannot recognize the human body positioning and posture.

Second, they suffer from limited acceptance and use due to: elders' perception and image issues, high rate of false alarms and miss-detects, elders neglect re-wearing when getting out of bed or bath, and the fact that long term usage of wearable devices might lead to user skin irritations. Third, the wearable PERS are used mainly after experiencing a fall (very limited addressable market).

Therefore, there is a need for a paradigm shift toward automated and remote monitoring systems.

SUMMARY OF THE INVENTION

Some embodiments of the present invention provide a unique sensing system and a breakthrough for the supervision of the elderly during their stay in the house, in general, and detect falls, in particular. The system may include: a UWB-RF Interferometer, Vector Quantization based Human states classifier, Cognitive situation analysis, communication unit and processing unit.

One aspect of the present invention provides a method comprising: (i) transmitting, via at least one transmitting antenna, ultra-wide band (UWB) radio frequency (RF) signals at an environment including at least one human, and receiving, via at least one receiving antenna, echo signals from the environment, (ii) processing the received echo signals to derive a spatial distribution of echo sources in the environment using spatial parameters of the at least one transmitting and/or receiving antennas, and (iii) estimating a posture of the at least one human by analyzing the spatial distribution with respect to echo intensity.

According to some embodiments of the present invention, the system may be installed in the house's ceiling, and covers a typical elder's apartment with a single sensor, using Ultra-Wideband RF technology. It is a machine learning based solution that learns the elder's unique characteristics (e.g., stature, gait and the like) and home primary locations (e.g., bedroom, restroom, bathroom, kitchen, entry, etc.), as well as the home external walls boundaries.

According to some embodiments of the present invention, the system may automatically detect and alert emergency situation that might be encountered by elders while being at home and identify the emergency situations.

According to some embodiments of the present invention, the system may detect falls of elderly people, but may also identify other emergencies situations, such as labored breathing, sleep apnea, as well as other abnormal cases, e.g., sedentary situation, repetitive non-acute falls that are not reported by the person. It is considered as a key element for the elderly connected smart home, and, by connecting the system to the network and cloud, it can also make use of data analytics to identify new patterns of emergencies and abnormal situations.

These, additional, and/or other aspects and/or advantages of the present invention are set forth in the detailed description which follows; possibly inferable from the detailed description; and/or learnable by practice of the present invention.

BRIEF DESCRIPTION OF THE DRAWINGS

The subject matter regarded as the invention is particularly pointed out and distinctly claimed in the concluding portion of the specification. The invention, however, both as to organization and method of operation, together with objects, features, and advantages thereof, may best be understood by reference to the following detailed description when read with the accompanying drawings in which:

FIGS. 1A-1C are block diagrams illustrating a non-limiting exemplary architecture of a system in accordance with embodiments of the present invention.

FIGS. 2A and 2B are high level schematic illustrations of configurations of a linear baseline (SAAA), according to some embodiments of the invention.

FIG. 2C illustrates a non-limiting example for image resolution data achieved under the parameters defined above, for the various human posture and ranges from the system, according to some embodiments of the invention.

FIG. 2D schematically illustrates the dependency of image resolution on the orientation of the object, according to some embodiments of the invention.

FIGS. 2E-2G are high level schematic diagrams illustrating conceptual 2D Synthetic Aperture Antennas arrays with virtual displacements, according to some embodiments of the invention.

FIGS. 2H-2J are high level schematic illustrations of linear antennas arrays, according to some embodiments of the invention.

FIGS. 2K and 2L are simulation results that present the field of view of the array designs, according to some embodiments of the invention.

FIG. 2M shows simulation results that present the VSWR (Voltage Standing Wave Ratio) with and without metal beams, or walls, according to some embodiments of the invention.

FIGS. 2N and 2O schematically illustrate an antenna array with tilted baselines, according to some embodiments of the invention.

FIG. 2P is high level schematic illustrations of conceptual 2D Synthetic Aperture Antennas arrays providing unambiguous positioning, according to some embodiments of the invention.

FIGS. 2Q and 2R illustrate the coverage of the system's surroundings in the non-limiting case of four baselines, according to some embodiments of the invention.

FIG. 2S is a high level schematic illustration of the system with two home cells as a non-limiting example, according to some embodiments of the invention.

FIG. 3A is a high level schematic block diagram of the system which schematically illustrates modules related to the posture extraction, in accordance with embodiments of the present invention.

FIG. 3B is a high level schematic block diagram of the operations performed by a preprocessing unit, in accordance with embodiments of the present invention.

FIGS. 3C and 3D are illustrative examples for partially coherent images, according to some embodiments of the invention.

FIGS. 3E and 3F are illustrative examples for computed projections on the x, y and z axes of 3D images of a person standing and laying, respectively, in front of the sensor according to some embodiments of the invention.

FIG. 3G illustrates schematically seven features on a schematic curve representing an arbitrary projection.

FIG. 3H is an illustration of an exemplary spanning of the features' space by two of the features described above, according to some embodiments of the invention.

FIG. 3I which is a schematic block diagram illustrating a training module in the posture classifier, according to some embodiments of the invention.

FIG. 3J is a schematic block diagram of a classifying stage in the posture classifier, according to some embodiments of the invention.

FIG. 4A is a high-level schematic flowchart illustration of exemplary motion feature extraction in feature extractor, according to some embodiments of the invention.

FIG. 4B is a high-level schematic illustration of fast and slow signal mapping, according to some embodiments of the invention.

FIG. 4C is a high-level schematic flowchart illustration of exemplary human body target detection, according to some embodiments of the invention.

FIG. 4D is a high-level schematic flowchart illustration of an exemplary slow signal preprocessing unit, according to some embodiments of the invention.

FIG. 4E is a high-level schematic flowchart illustration of exemplary Doppler preprocessing and segmentation, according to some embodiments of the invention.

FIG. 4F is a high-level schematic flowchart illustration of an exemplary maximal Doppler frequency extraction, according to some embodiments of the invention.

FIG. 4G is an exemplary illustration of a spectrogram of motion over a single range bin in the active area, according to some embodiments of the invention.

FIG. 4H is a high-level schematic flowchart illustration of an exemplary motion energy features extractor, according to some embodiments of the invention.

FIG. 4I is a high-level schematic flowchart illustration of an exemplary range-time preprocessing and segmentation flow as part of derivation of energy signature, according to some embodiments of the invention.

FIG. 4J is a high-level schematic flowchart illustration of an exemplary over-range energy distribution analysis as part of derivation of energy signature, according to some embodiments of the invention.

FIG. 4K is a high-level schematic flowchart illustration of an exemplary over-range activity distribution analysis, according to some embodiments of the invention.

FIG. 4L is a high-level schematic flowchart illustration of an exemplary motion route energy estimation, according to some embodiments of the invention.

FIG. 4M, being a schematic matrix illustration of DTW-based motion route estimation, according to some embodiments of the invention.

FIG. 4N is a schematic illustration of the possibility to separate different types of motions based on the derived parameters, according to some embodiments of the invention.

FIG. 5A is a table illustrating an exemplary states definition in accordance with some embodiments of the present invention.

FIG. 5B is a table illustrating an exemplary states matrix in accordance with some embodiments of the present invention.

FIG. 6 is a table illustrating exemplary abnormal patterns in accordance with some embodiments of the present invention.

FIG. 7 is a diagram illustrating a cloud-based architecture of the system in accordance with embodiments of the present invention.

FIG. 8 is a floor plan diagram illustrating initial monitored person training as well as the home environment and primary locations training in accordance with embodiments of the present invention.

FIG. 9 is a diagram illustrating yet another aspect in accordance with some embodiments of the present invention.

FIG. 10 is a high level schematic flowchart of a method, according to some embodiments of the invention.

It will be appreciated that for simplicity and clarity of illustration, elements shown in the figures have not necessarily been drawn to scale. For example, the dimensions of some of the elements may be exaggerated relative to other elements for clarity. Further, where considered appropriate, reference numerals may be repeated among the figures to indicate corresponding or analogous elements.

DETAILED DESCRIPTION OF THE INVENTION

Prior to the detailed description being set forth, it may be helpful to set forth definitions of certain terms that will be used hereinafter.

The term “slow signal” as used in this application refers to the signal derived from received echo (fast) signals and is spatio-temporally characterized over multiple range bins (as spatial units) and multiple sub-frames (as temporal units).

The term “motion” as used in this application refers to the motion of the body and/or of body parts without displacement of the whole body as a bulk, such as gestures, limb motions, posture changes such as sitting down or standing up, gait (separated from the displacement), motion suddenness (e.g., possible fall or collapse) etc.

The term “movement” as used in this application refers to the displacement of a person's body as a whole, irrespective of the motion of body parts such as the limbs. In certain embodiments, the term “movement” may be used to refer only to radial displacements and radial components of displacement with respect to the antenna, whereas tangential displacement may be discarded. In certain embodiments, tangential components of the displacement may be taken into account as movements as well.

The terms “transmitting antenna” and “receiving antenna” as used in this application refer are non-limiting in the sense that the system may be configured to transmit signals via antennas denoted below as receiving antennas and receive echo signals via antennas denoted below as transmitting antennas. It is known in the art that the terms “transmitting antenna” and “receiving antenna” are interchangeable in the sense that the associated electronic circuitry may be configured to reverse their respective functions. System optimization may be carried out to determine which antennas are to be operated as transmitting antennas and which as receiving antennas. For the sake of simplicity alone, most of the following description related to transmitting antennas as single antennas and to receiving antennas as baselines (linear arrangements of antennas). It is explicitly noted that receiving antennas may be single antennas and transmitting antennas may be baselines, while maintaining the applicability and scope of the invention as described below.

In the following description, various aspects of the present invention are described. For purposes of explanation, specific configurations and details are set forth in order to provide a thorough understanding of the present invention. However, it will also be apparent to one skilled in the art that the present invention may be practiced without the specific details presented herein. Furthermore, well known features may have been omitted or simplified in order not to obscure the present invention. With specific reference to the drawings, it is stressed that the particulars shown are by way of example and for purposes of illustrative discussion of the present invention only, and are presented in the cause of providing what is believed to be the most useful and readily understood description of the principles and conceptual aspects of the invention. In this regard, no attempt is made to show structural details of the invention in more detail than is necessary for a fundamental understanding of the invention, the description taken with the drawings making apparent to those skilled in the art how the several forms of the invention may be embodied in practice.

Before at least one embodiment of the invention is explained in detail, it is to be understood that the invention is not limited in its application to the details of construction and the arrangement of the components set forth in the following description or illustrated in the drawings. The invention is applicable to other embodiments that may be practiced or carried out in various ways as well as to combinations of the disclosed embodiments. Also, it is to be understood that the phraseology and terminology employed herein is for the purpose of description and should not be regarded as limiting.

Unless specifically stated otherwise, as apparent from the following discussions, it is appreciated that throughout the specification discussions utilizing terms such as “processing”, “computing”, “calculating”, “determining”, “enhancing” or the like, refer to the action and/or processes of a computer or computing system, or similar electronic computing device, that manipulates and/or transforms data represented as physical, such as electronic, quantities within the computing system's registers and/or memories into other data similarly represented as physical quantities within the computing system's memories, registers or other such information storage, transmission or display devices. Any of the disclosed modules or units may be at least partially implemented by a computer processor.

A sensing system is provided for the supervision and fall detection of the elderly during their stay in the house. The system combines an UWB-RF (ultra-wide band radio frequency) interferometer with a vector-quantization-based human states classifier implementing cognitive situation analysis. The UWB-RF interferometer may implement a synthetic aperture and the human states classifier may have two stages and employ abnormal states pattern recognition. The system may be installed in the house's ceiling, and cover the area of a typical elder's apartment (<100 sqm) with a single sensor, using ultra-wideband RF technology.

The system may use machine learning to learn the elder's unique characteristics (e.g., body features, stature, gait etc.) and the home environment, and uses a human state classifier to determine the instantaneous human state based on various extracted features such as human posture, motion, location at the environment as well as human respiration. The system may automatically detect, identify and alert concerning emergency situations (particularly falls) that might be encountered by elders while being at home and identifies the emergency situations. The system detects falls as well as identifies other emergency situations such as labor briefing, sedentary situations and other abnormal cases. The decision process may be done based on the instantaneous human state (local decision) followed by abnormal states patterns recognition (global decision). The system global decision (emergency alert) is communicated to the operator through the communication system and two-ways communication is enabled between the monitored person and the remote operator.

The system may comprise a communication sub-system to communicate with the remote operator and centralized system for multiple users' data analysis. A centralized system (cloud) may receive data from distributed PERS systems to perform further analysis and upgrading the systems with updated database (codebooks).

Advantageously, the system may be used as a key element for the elderly connected smart home and by connecting the system to the network and cloud, it can also make a use of big data analytics to identify new patterns of emergencies and abnormal situations. The system overcomes the disadvantages of existing PERS such as wearable fall detectors and alarm buttons, as well as visual surveillance, by recognizing the human body positioning and posture and provides a significant enhancement in acceptability as it overcomes (i) elders' perception and image issues, (ii) high rate of false alarms and misdetections, (iii) elders' neglect of re-wearing when getting out of bed or bath, and (iv) user skin irritations by long term usage of wearable devices. Moreover, it may be used to prevent the first experience of fall (after which the use of wearable devices is first considered) and does not involve privacy issues that visual surveillance system arise.

FIGS. 1A-1C are block diagrams illustrating a non-limiting exemplary architecture of a system 100 in accordance with some embodiments of the present invention. As illustrated in FIG. 1A, system 100 may include a radio frequency (RF) interferometer 120 configured to transmit signals via Tx antenna 101 and receive echo signals via array 110-1 to 110-N. Tx antennas 101 and Rx antennas 110 are part of an antenna array 115. It should be noted that transmit antennas and receive antennas may take different forms, and, according to a preferred embodiment, in each antenna array they may be a single transmit antenna and several receive antennas. An environmental clutter cancelation module may or may not be used to filter out static non-human related echo signals. System 100 may include a human state feature extractor 130 configured to extract from the filtered echo signals, a quantified representation of position postures, movements, motions and breathing of at least one human located within the specified area. A human state classifier may be configured to identify a most probable fit of human current state that represents an actual human instantaneous status. System 100 may include an abnormality situation pattern recognition module 140 configured to apply a pattern recognition based decision function to the identified states patterns and to determine whether an abnormal physical event has occurred to the at least one human in the specified area. A communication system 150 for communicating with a remote server and end-user equipment for alerting (not shown here). Communication system 150 may further include two-way communication system between the caregiver and the monitored person for real-time assistance.

As illustrated in FIG. 1B, system 100 comprises a system controller 105, a UWB-RF interferometry unit 220, a human state classifier 250, a cognitive situation analysis module 260 and communication unit 150, the operation of which is explained below (see FIG. 1C). UWB-RF interferometry unit 220 comprises a UWB pulse generator 221, a UWB RF transmission module 121, UWB transmitting antennas 101 that deliver a UWB RF signal 91 to an environment 80, e.g., one including at least one human 90, UWB receiver antennas 110 that receive echo signals 99 from the scene and UWB RF interferometer 120 that processes the received echo signals and provide signals for extraction of multiple features, as explained below. Tx antennas 101 and Rx antennas 110 are part of antenna array 115.

FIG. 1C is another block diagram illustrating the architecture of system 100 in further details in accordance with some embodiments of the present invention as follows. UWB-RF interferometry unit 220 transmits an ultra-wideband signal (e.g., pulse) into the monitored environment and receives back the echo signals from multiple antenna arrays to provide a better spatial resolution by using the Synthetic Antenna Aperture approach. For example, UWB-RF interferometry unit 220 may comprise transmission path pulse generator 221, UWB-RF front end 223 connected to transmitting antenna(s) 101 and receiving antennas 110-1 . . . 110-N, e.g., arranged in arrays, and configured to transmit UWB RF signals generated by generator 221 to the environment and deliver echo pulses received therefrom to a reception path pro-processing module 222, possible implementing clutter cancelation with respect to clutter originating from the environment and not from human(s) in the environment. In order to increase the received signal-to-noise (SNR), the transmitter sends multiple UWB pulses and receiver receives and integrates multiple echo signals (processing gain). The multiple received signals (one signal per each Rx Antenna) are sampled and digitally stored for further signal processing.

Environmental clutter cancelation 230 may be part of a processing unit 225 as illustrated and/or may be part of UWB-RF interferometry unit 220, e.g., clutter cancelation may be at least partially carried out by a Rx path pre-processing unit 222. The echo signals are pre-processed to reduce the environmental clutter (the unwanted reflected echo components that are arrived from the home walls, furniture, etc.). The output signal mostly contains only the echo components that reflected back from the monitored human body. Environmental clutter cancelation 230 is fed with the trained environmental parameters 232. In addition, the clutter cancelation includes a stationary environment detection (i.e., no human body at zone) to retrain the reference environmental clutter for doors or furniture movement cases.

The environmental clutter cancelation is required to remove unwanted echo components that are reflected from the apartment's static items, such as walls, doors, furniture, etc. The clutter cancelation is done by subtracting the unwanted environmental clutter from the received echo signals. The residual clutter represents the reflected echo signals from the monitored human body. According to some embodiments of the present invention, the clutter cancelation also includes stationary environment detection to detect if no person is at the environment, such as when the person is not at home, or is not at the estimated zone. Therefore, a periodic stationary clutter check is carried out, and new reference clutter fingerprint is captured when the environment is identified as stationary. The system according to some embodiments of the present invention re-estimates the environmental clutter to overcome the clutter changes due to doors or furniture movements.

Feature extractor 240 that processes the “cleaned” echo signals to extract the set of features that will be used to classify the instantaneous state of the monitored human person (e.g., posture, location, motion, movement, breathing, see more details below). The set of the extracted features constructs the feature vector that is the input for the classifier.

Human state classifier 250—The features vector is entered to a Vector Quantization based classifier that classifies the instantaneous features vector by statistically finding the closest pre-trained state out of a set of N possible states, i.e., finding the closest code vector (centroid) out of all code vectors in a codebook 234. The classifier output is the most probable states with its relative probability (local decision).

Cognitive Situation Analysis (CSA) module 260—This unit recognizes whether the monitored person is in an emergency or abnormal situation. This unit is based on a pattern recognition engine (e.g., Hidden Markov Model-HMM, based). The instantaneous states with their probabilities are streamed in and the CSA search for states patterns that are tagged as emergency or abnormal patterns, such as a fall. These predefined patterns are stored in a patterns codebook 234. In case that CSA recognizes such a pattern, it will send an alarm notification to the healthcare center or family care giver through the communication unit (e.g., Wi-Fi or cellular). Two-way voice/video communication unit 150—this unit may be activated by the remote caregiver to communicate with the monitored person when necessary. UWB-RF interferometry unit 220 may include the following blocks: (i) Two-Dimensional UWB antenna array 110-1-110-N to generate the synthetic aperture through all directions, followed by antenna selector. (ii) UWB pulse generator and Tx RF chain to transmit the pulse to the monitored environment UWB Rx chain to receive the echo signals from the antenna array followed by analog to digital converter (ADC). The sampled signals (from each antenna) are stored in the memory, such as SRAM or DRAM.

In order to increase the received SNR, the RF interferometer may repeat the pulse transmission and echo signal reception per each antenna (of the antenna array) and coherently integrate the digital signal to improve the SNR.

Antenna Array and Interferometer

In order to successfully classify the human posture (based on the received echo signals) from any home location, an optimized 2-Dimentional (2D) switched antenna array with a very wide field of view (FOV) was designed to generate the 3-dimentional (3D) back-projection image with a small, or even with a minimal number of antennas. In order to cover the complete home environment, it may be split into several home cells, each with an installed system that detects and tracks the monitored person through the home environment. Coverage and infrastructure consideration may be used to determine the exact system configuration at different home environment. When the monitored person moves from one home cell to another, a pre-defined set of criteria may be used to determine whether to hand-over the human tracking from one cell to another. The width of the antenna array FOV may be configured to reduce the number of home cells while maintain the system's efficiency and reliability.

FIGS. 2A and 2B are high level schematic illustrations of configurations of a linear baseline (SAAA) 110, according to some embodiments of the invention. FIG. 2A schematically illustrates an inline configuration with individual elements separated by D/2 and a staggered configuration with two lines of alternating elements separated by D/2 (on each line elements are separated by D). FIG. 2B schematically illustrates some more details of linear baseline 110. FIG. 2C illustrates a non-limiting example for image resolution data achieved under the parameters defined above, for the various human posture and ranges from system 100, according to some embodiments of the invention. FIG. 2D schematically illustrates the dependency of image resolution on the orientation of the object, according to some embodiments of the invention.

The human posture may be determined by analyzing and classifying the 3-dimentional human image as reconstructed by the back-projection function based on the received echo signals (see above). The image resolution is determined by the interferometer's Down Range (the image resolution in the interferometer's radial direction—ΔR_(dr)) and Cross Range (the image resolution in the interferometer's angular direction—ΔR_(cr)), with ΔR_(dr) determined by the transmitted pulse width and ΔR_(cr) determined by the Antenna Aperture and the range from the interferometer. In order to increase the antenna aperture, a Synthetic Aperture Antenna Array (SAAA) approach may be used by a switched antenna array. Every SAAA is termed herein a Baseline.

The resolutions for SAAA (Baseline) 110 is given by ΔR_(dr)=c/2B.W. and ΔR_(cr)=λR/S.A. with c being the speed of light, B.W. being the pulse bandwidth, λ being the wave length, R being the range from the system's antenna 110, and S.A. being the synthetic aperture. ΔR_(dr) and ΔR_(cr) are selected to ensure that classifier 250 can recognize the human posture. As a non-limiting example, the following parameter ranges may be used: B.W. between 1 and 3 GHz (in a non-limiting example, B.W.=1.5 GHz), λ between 0.03 m and 0.1 m (in a non-limiting example, λ=0.06 m), f between 3 and 9 GHz (in a non-limiting example, f=5 GHz), S.A. between 0.1 m and 0.7 m (in a non-limiting example, S.A.=0.33 m), N_(antennas) between 3 and 21 antennas per baseline (in a non-limiting example, N=12), Antenna spacing between 0.03 m and 0.1 m (in a non-limiting example, 0.03 m) with respect to scene parameters: Ceiling height=2.5 m, sitting person height=1 m, standing person height=1.5 m. Terminated antennas are shown as elements that regulate the operation of the last receiver antennas 110-1 and 110-N in the row.

FIG. 2C presents image downrange and cross-range resolutions with respect to the floor (assuming system 100 is mounted on the ceiling) to a sitting person, a standing person and laying person on floor. The linear baseline may be considered as a switched antenna array in a constant spacing between each antenna element 110-1 . . . 110-N. Specific antenna elements may be selected through a control channel 102 to perform the synthetic aperture.

FIG. 2D schematically illustrates the dependency of image resolution on the orientation of the object, according to some embodiments of the invention. The resolution is illustrated schematically by the size of the rectangles in the figure. As seen in FIG. 2C, the DownRange (DR) resolution is constant (depends on the bandwidth) while the CrossRange (CR) resolution depends on the antenna aperture and on the distance of the human from antenna array 110 of system 100.

FIGS. 2E-2G are high level schematic diagrams illustrating conceptual 2D Synthetic Aperture Antennas arrays 115 with virtual displacements, according to some embodiments of the invention. In FIG. 2E, antenna array system 115 may include several linear arrays of antennas 110A, 110B, 110C and 110D, as a non-limiting example. Each row (linear antenna 110A-D) may have a plurality of receive antennas 110-1 . . . 110-N as explained above; and/or additional transmitting and/or receiving antennas may be part of array 115. As a non-limiting example, one or more Tx antennas 101, 101A-D are illustrated at the central region of array 115. The solid line arrowed X marked 103 in FIG. 2E illustrates the relative shifts of Tx antennas 101A-D with respect to Tx antenna 101.

In FIG. 2F, 2D array structure 115 is shown with four baselines (linear arrays) 110A-D located along sides of a square. Tx antenna(s) 101 may be at the central region of 2D array structure 115. FIG. 2F illustrates schematically the effect of using virtual-displacement Tx Antennas 101A-D as virtual movements of Rx baselines 110A-D in a same displacement vector (step and direction) as the moves from the respective virtual-displacement Tx Antenna 101A-D to the original central Tx Antenna 101. The virtual displacements marked are denoted by broken line arrowed X's marked 113. Virtual displacement of Tx antenna 101 to 101A-D, e.g., by toggling between original central Tx antenna 101 and any of virtual-displacement Tx Antennas 101A-D introduces additional set of echo signals (Scatter) with different Radar Cross Section (RCS) from the target person with different signals' phases as a result of new roundtrip path from transmitting antenna, target, and receiving baselines (antennas arrays). The additional diverse scatter (four additional echo signals sets) improves the reconstructed image in both additional processing gain (target reflection intensity) as well as additional information due to the Tx antenna diversity.

It is emphasized that the indication of the transmitting antenna(s) as antenna elements 101 (and/or 101A-D) and the indication of the receiving baseline(s) as antenna elements 110 (e.g., 110A-D) may be reversed, i.e., antenna elements 101 (and/or 101A-D) may be used as receiving antennas and antenna elements 110 (e.g., 110A-D) may be used as transmitting antennas. System 100 may be configured with receiving antennas 101 and transmitting antennas 110.

In FIG. 2G, 2D array structure 115 is shown with four linear arrays 110A-D located along sides of a square and Tx antenna 101 at the center of the square. Baseline arrays 110A-D may be virtually displaced (marked schematically by the gray arrowed X's) to yield additional virtual baselines 113A-D to improve the back-projection image (see above) by increasing the number of echo signals 99 with additional diversity. Virtual displacements of baseline arrays 110A-110D (FIG. 2G) may be combined with virtual displacements of Tx antenna 101 (FIG. 2F) as well as with non-square positions (FIG. 2E) in any practical configuration to optimize the antenna array configuration with respect to performance, size and cost.

UWB RF interferometer 120 may be to use multiple antennas to implement virtual displacement of the baselines—either multiple antennas 101 are receiving antennas and the virtual displaced baselines 110 are transmitting baselines, or multiple antennas 101 are transmitting antennas and the virtual displaced baselines 110 are receiving baselines.

FIGS. 2H-2J are high level schematic illustrations of linear antennas arrays 115, according to some embodiments of the invention. FIGS. 2K and 2L are simulation results that present the FOV of the array designs, according to some embodiments of the invention. The simulations are electromagnetic simulations at the E-Plane. As shown above, the major requirement from the linear antenna array for home environment is having a large field of view, which becomes a real challenge for a UWB antenna array. An innovated approach of widening the antenna array field of view is presented herein. Exemplary implementations of UWB antenna element 110 illustrated in FIGS. 2H-2J provide Field Of View (FoV) performances that are described in FIG. 2K (for the configuration of FIGS. 2H, 2I) and FIG. 2L (for the configuration of FIG. 2J) for a range of UWB frequencies. FIG. 2J schematically illustrates the addition of (e.g., two) metal beams 114 added along array 110 that widen the FOV, as illustrated in the simulation results in FIG. 2L (compare the wider FOV with respect to FIG. 2K). FIG. 2M shows simulation results that present the VSWR (Voltage Standing Wave Ratio) with and without metal beams 114 (=walls), according to some embodiments of the invention. FIG. 2M illustrates that metal walls 114 improve the antenna's VSWR at the relevant operation UWB band (4-6 GHz) with respect to an antenna lacking walls 114.

In certain embodiments, a BALUN (Balance/Unbalance unit) may be located vertically below the antennas strip (e.g., one or more of baselines 110).

FIGS. 2N and 2O schematically illustrate antenna array 115 with tilted baselines 110A-D, according to some embodiments of the invention. Baselines 110A-D may be tilted from their common plane, e.g., by a tilt angle α 112 ranging e.g., between 10-60°, so that, when antenna array 115 is installed on a ceiling, baselines 110A-D do not face directly downwards but somewhat sideways, by tilt angle α 112. The provided tilt provides a larger field of view of antenna array 115 and hence system 100. An optimization may be carried out involving as parameters e.g., the antenna array unit vertical dimension (enabling the tilt), the field of view of the baselines and the array, and the degree of overlap between different baselines.

FIG. 2P is a high level schematic illustrations of conceptual 2D Synthetic Aperture Antennas arrays 115 providing unambiguous positioning, according to some embodiments of the invention. These embodiments of non-limiting exemplary configurations enable to validate a location of a real target 90A by eliminating the possible images 95A and 95B after checking reflections 99 received at corresponding sub-arrays of antennas 110A and 110D, respectively. It is well understood that these configurations are non-limiting examples and other antennas configurations may be used effectively. Any combinations of embodiments of antenna arrays 115 illustrated herein are also considered part of the present invention. Two-dimensional array 115 guarantees that echo signals 99 are received from any direction around array 115 (assuming that each baseline 110A-D has a field of you of at least 120 degrees), and as shown in the illustration, solves the direction ambiguity of each individual baseline.

FIGS. 2Q and 2R illustrate the coverage of the system's surroundings in the non-limiting case of four baselines 110A-D, according to some embodiments of the invention. In FIG. 2Q, the coverage 117A-D of each baseline 110A-D is illustrated alongside uncovered angular ranges 116A-D. For the sake of clarity, single baseline 110 with coverage angular ranges 117 and uncovered angular ranges 116 is also illustrated. In this schematic non-limiting illustration, coverage angular ranges 117 are considered as being within the primary beam of the baseline (−3 dB), between +60° and −60°. It is noted that wider or narrower definitions may be alternatively used with respect to the baseline and system performance and requirements.

FIG. 2R exemplify possible angular ranges 117A-D in degrees (relating to 360° as the full circle coverage around array 115, i.e., 390°=30°) which cover the whole range around array 115 with overlaps in baseline ranges covered by two baselines. The FoV is defined as the −3 dB points and may be designed to cover 120° (±60°. Baselines 110 may be arranged to cover 360° with respect to array 115 with a certain overlap between baselines 110. Complementarily, baselines 110 may be arranged to solve the human target direction ambiguity by sufficient coverage and overlap requirements. Similar consideration may be taken with respect to either or both primary and secondary beams.

FIG. 2S is a high level schematic illustration of system 100 with two home cells 108A and 108B as a non-limiting example, according to some embodiments of the invention. In some houses/apartments environments 80, PERS system 100 may comprise more than one sub-systems 100A, 100B and/or more than one antenna arrays 115A, 115B to cover whole environment 80 effectively and to monitor target person 90 everywhere in environment 80. For example, home environment 80 may be split into several home cells 80A, 80B, with respective sub-systems 100A, 100B and/or antenna arrays 115A, 115B that create respective sub-cells 108A, 108B. Sub-systems 100A, 100B, etc. may each comprise, e.g., a UWB RF interferometry unit, a human state feature extractor and a human state classifier. Control unit 105 of system 100 regulates (e.g., according to a pre-defined set of criteria) hand-overs between sub-systems 100A, 100B and/or between antenna arrays 115A, 115B as monitored person 90 moves between home cells 108A, 108B, while maintaining continuous detection and tracking. Examples for handing over criteria comprise: (i) BPI_(i)>BPI_(j) with BPI being the back-projection (accumulated) intensity from the monitored person as received at PERS_(i) 100A and PERS_(j) 100B; and/or (ii) PDR_(i)<PDR_(j) with PDR being the person down range distance from PERSi 100A and PERSj 100B as is estimated by each PERS unit. Abnormality situation pattern recognition module 140 of system 100 may be further configured to integrate input from all sub-systems 100A, 100B etc.

The multiple PERS sub-systems may hand-over person tracking among themselves by any of the following exemplary ways: (i) Hard hand-off: Once the handing over criteria are fulfilled by the releasing PERS unit, the person's tracking is moved from the releasing PERS unit which stops the tracking to the receiving PERS unit that starts tracking (break before make); (ii) Soft Hand-off: Once the handing over criteria are fulfilled by the releasing PERS unit, the person's tracking is moved from the releasing PERS unit that keeps tracking the person and sends the information to the receiving PERS unit that starts tracking the person. The realizing PERS unit stops tracking when the receiving PERS acknowledges that it successfully tracks the person (make before break); and (iii) Co-tracking: Each PERS sub-system that sufficiently identifies the person performs the tracking as long as the received scatter signal doesn't decrease below a predefine threshold from the maximum received signal among all the active PERS units. In this mode, the system decision is based on majority based voting between all the PERS units.

Multiple Features Extraction

Multiple features may be extracted y processing unit 225 from received echo signals by interferometer 120. For example, processing unit 225 may be configured to process the received echo signals to derive a spatial distribution of echo sources in the environment using spatial parameters of transmitting and/or receiving antennas 101, 110 respectively, with features extractor 240 being configured to estimate a posture of at least one human 10 by analyzing the spatial distribution with respect to echo intensity, as explained in detail below. For example, processing unit 225 may be configured to cancel environmental clutter by filtering out static non-human related echo signals, process the received echo signals by a back-projection algorithm, and analyze the spatial distribution using curve characteristics of at least two projections of an intensity of the received echo signals onto a vertical axis and at least one horizontal axis, as explained below.

The “cleaned” echo signal vectors may be used as the raw data for the features extraction unit. This unit extracts the features that mostly describe the instantaneous state of the monitored person. The following are examples for the set of the extracted features and the method it's extracted: Position—the position is extracted as the position (in case of 2D—angle/range, in case of 3D—x,y,z coordinates) metrics output of each array baseline. The actual person position at home will be determined as a “finger print” method, i.e., the most proximity to the pre-trained home position matrices (centroids) codebook. Posture—the person posture (sitting, standing, and laying) will be extracted by creating the person “image” by using, e.g., a back-projection algorithm. Both position and posture are extracted, for example, by operating, e.g., the Back-projection algorithm on received echo signals—as acquired from the multiple antennas array in SAR operational mode.

Human Posture

One aspect of the present invention provides a unique human posture sensing and classification system and a breakthrough for the supervision of the elderly instantaneous status during their stay in the house, in general, and extracting features of the human position and posture in particular. The innovated system may be part of the Personal Emergency Response system (PERS) installed in the house's ceiling, and covers a typical elder's apartment (<100 sqm) with a single sensor. The innovated system helps detecting and alerting an emergency situation that might be encountered by elders while being at home. The innovated system may also enable the long term monitoring of elderly activities and other behavioral tendencies during the staying at home.

The following is an outline of the procedure used to find the human position and posture, comprising the following steps: Dividing the surveillance space into voxels (small cubes) in cross range, down range and height; Estimating the reflected EM signal from a specific voxel by the back projection algorithm; Estimating the human position by averaging the coordinates of the human reflecting voxels for each baseline (Synthetic Aperture Antenna Array); Triangulating all baselines' position to generate the human position in the environment; Estimating the human posture by mapping the human related high-power voxels into the form-factor vector; and Tracking the human movements in the environment (bedroom, restroom, etc.).

FIG. 3A is a high level schematic block diagram of system 100 which schematically illustrates modules related to the posture extraction, in accordance with embodiments of the present invention. As explained in detail above, system 100 comprises UWB-RF interferometer 120 associated with antenna array 115 and delivering the received echo signals to home environment clutter cancelation 230. The echo signals are then delivered to a pre-processor 302, a human posture image back-projection reconstruction module 310, possibly with floor enhancement 312 and projection on the x, y, z axes 315 and finally to posture features extractor 240A (as part of feature extractor 240) and consequently to posture classifier 250A (as part of classifier 250) which derived a classified posture 317, possibly using codebook 234.

Environmental clutter cancelation 230 may be configured to remove the unwanted echo components that are reflected from the apartment's static items as walls, door, furniture etc. The clutter cancelation may be carried out by subtracting the unwanted environmental clutter from the received echo signals. The residual clutter (scatter) represents the reflected echo signals from the monitored human body. System 100 may be configured to estimate (e.g., implementing a learning algorithm) the environmental clutter (to be cancelled) when there is no person at the environment, e.g., the person is not at home, or is not at the estimated zone, and use the estimated clutter for clutter cancellation 230. Environmental clutter cancelation module 230 may comprise a stationary environment detector that decided when the unit may re-estimate the environmental clutter, possibly with an addition manual control to perform the initial estimation during the system setup.

FIG. 3B is a high level schematic block diagram of the operations performed by preprocessing unit 302, in accordance with embodiments of the present invention. Preprocessing unit 302 may be configured to perform the following blocks for each of the received echo (fast) signals: DC removal 302A by continuously estimating the DC signal (time varying DC). The estimated DC signal is subtracted from the original signal. Gain mismatch correction 302B may be performed to compensate for the path loss differences among each of the interferometer's antennas received fast signals. Phase mismatch correction 302C may be performed to compensate for the time delay among the fast signals. An out of band (O.O.B.) noise reduction filter 302D (matched filter) may be configured to filter out the out of pulse bandwidth noise and interferences.

Monitored the person's posture (e.g., sitting, standing, and laying) may be extracted (240A) by creating the person's low resolution “image”, corresponding to a spatial distribution of echo sources, by using back-projection algorithm 310. For example, position and posture may be extracted by operating back-projection algorithm 310 on received echo signals as acquired from the multiple antennas array in Synthetic Aperture Antenna Array (SAAA) operational mode, illustrated in FIG. 2E.

For example, 3D back-projection may be formulated as indicated in Equations 1, by defining the locations of a J-transmitting antenna elements as the transmitting array (e.g., either of antennas 101 or antennas 110) and a K-receiving antenna elements as the receiving array (e.g., the other one of antennas 101 or antennas 110), expressing the received fast signals denoted J·K and deriving the absolute image value I(V_(m)) using the confocal microwave imaging algorithm, applied to any selected voxel V_(m) in the region of interest.

J-transmitting antenna elements (transmitting array) located at—{x_(tj), y_(tj), z_(tj)}_(j=1) ^(J)

-   -   K—receiving antenna elements (receiving array) located         at—{x_(rk),y_(rk), z_(rk)}_(k=1) ^(K)     -   J·K received fast signals are—{{s_(j,k)(t)}_(j=1) ^(J)}_(k=1)         ^(K) where 0≦t≦T.

$\begin{matrix} {{{{Voxel}\mspace{14mu} V_{m}} = \left( {x_{m},y_{m},z_{m}} \right)}{{I\left( V_{m} \right)} = {{\sum\limits_{j = 1}^{J}\;{\sum\limits_{k = 1}^{K}\;{{s_{j,k}\left( {t_{j,k}\left( V_{m} \right)} \right)}{\mathbb{e}}^{{j\varphi}_{j,k}{(V_{m})}}}}}}}} & {{Equations}\mspace{20mu} 1} \end{matrix}$

The summation is over all the received fast signals S_(j,k) (t_(j,k)(V_(m))), and it contains the reflections equivalent to the round-trip, which is the total distance t_(j,k)(V_(m)) from each of the transmitting antennas to the specific voxel V_(m) and the distance from this specific voxel V_(m) to each of the receiving antennas, as calculated in Equations 2 in terms of the coordinates of the transmitting and receiving arrays. The phase φ_(j,k)(V_(m)) is also calculated as presented in Equations 2. c denotes the speed of light and f_(c) denotes the central frequency.

                                 Equations  2 ${t_{j,k}\left( V_{m} \right)} = \frac{{l_{j,m}\left( V_{m} \right)} + {l_{m,k}\left( V_{m} \right)}}{c}$ ${{l_{j,m}\left( V_{m} \right)} = \sqrt{\left( {x_{tj} - x_{m}} \right)^{2} + \left( {y_{tj} - y_{m}} \right)^{2} + \left( {z_{tj} - z_{m}} \right)^{2}}}\;$ ${{l_{m,k}\left( V_{m} \right)} = \sqrt{\left( {x_{m} - x_{rk}} \right)^{2} + \left( {y_{m} - y_{rk}} \right)^{2} + \left( {z_{m} - z_{rk}} \right)^{2}}}\;$ ${\varphi_{j,k}\left( V_{m} \right)} = {\frac{2\pi\; f_{c}}{c}\left( {{l_{j,m}\left( V_{m} \right)} + {l_{m,k}\left( V_{m} \right)}} \right)}$

The image, expressing the spatial distribution of the echo sources, may be reconstructed from the absolute image values I(V_(m)) by computing them for all the voxels in the region of interest Ω, i.e., I(V_(m))=I(x_(m),y_(m),z_(m)), x_(m)εΩ_(x), y_(m)εΩ_(y), z_(m)εΩ_(z). This derived image is denoted in the following the “Coherent Image”, as it is a coherent accumulation of the fast signals' intensity contributions from the Rx antennas. A “Partially Coherent Image”, which is a more sophisticatedly-derived spatial distribution of the echo sources, may be derived from several “2D Coherent Images” which are each reconstructed from a subset of fast signals, and are then multiplied by each other, as illustrated in Equations 3. Equations 3 relate as a non-limiting example to a single transmitting antenna (J=1) and 32 receiving antennas (K=32) in four subsets (Baselines—BL). (e.g., corresponding to central transmitting antenna 101 and receiving baseline 110). “Partially Coherent Image”: I(V _(m))=Π_(i=1) ⁴ I _(i)(V _(m)) “Coherent Images” (from subsets 1≦i≦4): I _(i)(V _(m))=Σ_(j=1) ^(i)Σ_(k=1) ⁸ s _(j,k)(t _(j,k)(V _(m)))e ^(jφ) ^(j,k) ^((v) ^(m) ⁾| subset 1: BL₁ ={s _(1,1)(t),s _(1,2)(t), . . . , s _(1,8)(t)} subset 2: BL₂ ={s _(1,9)(t),s _(1,10)(t), . . . , s _(1,16)(t)} subset 3: BL₃ ={s _(1,17)(t),s _(1,18)(t), . . . , s _(1,24)(t)} subset 4: BL₄ ={s _(1,25)(t),s _(1,26)(t), . . . , s _(1,32)(t)}   Equations 3

FIGS. 3C and 3D are illustrative examples for partially coherent images, according to some embodiments of the invention. FIGS. 3C and 3D are partially coherent images of a standing person and a laying person, respectively. As seen in FIGS. 3C and 3D, the echo sources are detected as a spatial distribution with a spatial resolution depending on the sizes of the voxels. The echo sources may be characterized, e.g., in terms of human postures, according to the calculated and processed spatial distribution. High power voxels may be defined by a specified power threshold, and used, possibly enhanced, to derive the posture features.

Floor enhancement module 312 is configured to compute a floor enhancement 3D image, denoted I (X,Y,Z), from the Back-Projection 3D image generated by module 310. In the floor enhancement image the intensity is increased in the region of interest, e.g., the lower part of the 3D image that corresponds to the floor. In the process, the 3D image is divided into e.g., three levels: the lower cube level (floor region), the intermediate (transition) region, and the upper level. For example, floor enhancement may be implemented multiplying the voxel intensity of floor region voxels by a factor greater than one, not altering the upper level voxels, and multiplying the intermediate (transition) region voxels by a smoothing function, such as the function exemplified, in a non-limiting manner, in Equation 4, with MaxWeight being the multiplication factor for floor region voxels and z being the height above the floor.

                                      Equation  4 ${{FloorEnhancementFunction}\mspace{14mu}(z)} = \left\{ \begin{matrix} {{MaxWeight},} & {z < {50\lbrack{cm}\rbrack}} \\ {{\frac{{MaxWeight} - 1}{100 - 50}z},} & {{50\lbrack{cm}\rbrack} \leq z \leq {100\lbrack{cm}\rbrack}} \\ {1,} & {z > {100\lbrack{cm}\rbrack}} \end{matrix} \right.$

Module 315 is configured to perform 3D image projection on the x, y, z axes, e.g., of the floor enhancement 3D image, by compressing the 3D image into three 1D signals for convenient processing. For this purpose, the projection of I(X, Y, Z) on axes x, y and z, denoted P_(x), P_(y) and P_(z), may be computed according to Equations 5. It is noted that one or more projection axis may be used, e.g., a vertical axis and one or more horizontal axes.

$\begin{matrix} {{P_{x}\overset{\bigtriangleup}{=}{{P\left( {X = x} \right)} = {\sum\limits_{y}{\sum\limits_{z}{{I\left( {{X = x},{Y = y},{Z = z}} \right)}\mspace{14mu}{\forall{x \in \Omega_{x}}}}}}}}{P_{y}\overset{\bigtriangleup}{=}{{P\left( {Y = y} \right)} = {\sum\limits_{x}{\sum\limits_{z}{{I\left( {{X = x},{Y = y},{Z = z}} \right)}\mspace{14mu}{\forall{y \in \Omega_{y}}}}}}}}{P_{z}\overset{\bigtriangleup}{=}{{P\left( {Z = z} \right)} = {\sum\limits_{x}{\sum\limits_{y}{{I\left( {{X = x},{Y = y},{Z = z}} \right)}\mspace{14mu}{\forall{z \in \Omega_{z}}}}}}}}} & {{Equation}\mspace{14mu} 5} \end{matrix}$

FIGS. 3E and 3F are illustrative examples for computed projections on the x, y and z axes of 3D images of a person standing and laying, respectively, in front of the sensor according to some embodiments of the invention. It is noted, e.g., that the z projection for the standing person image (FIG. 3E) is typically different than the z projection for the laying person image (FIG. 3F).

Various features may be computed for the three projections, P_(i) (i=x, y or z), such as: Standard Deviation(Pi), Kurtosis(Pi), Skewness(Pi), Max(Pi), Argmax(Pi), Min(Pi), Argmin(Pi), RightPosition(Pi), LeftPosition(Pi), Width(Pi), and so forth. The first three features are statistical characteristics of the curves, namely their second, third and fourth standardized moments (centered moments) defined in Equations 6, with p_(i) denoting the projections, x_(i) denoting the respective axis points (p_(i)=P_(d)(x_(i)) with dε{X, Y, Z} and x_(i)εΩ_(x)) and p denoting the average of p_(i) with N_(d) denoting the total samples per axis, i.e. N_(x), N_(y)or N_(z). FIG. 3G illustrates schematically seven features on a schematic curve representing an arbitrary projection. The features relating to the right and left of the curve may be defined as being at an intensity below a specified threshold with respect to the maximum, e.g., the threshold being between 5%-15% of the maximal intensity. The shorthand “arg” refers to the respective argument (axis value) and the width is defined between the right and left positions.

$\begin{matrix} {{\overset{\_}{p} = {\frac{1}{N_{d}}{\sum\limits_{i = 1}^{N_{d}}\; p_{i}}}}{{{Std}\;\left( P_{d} \right)} = \sqrt{\frac{1}{N_{d}}{\sum\limits_{i = 1}^{N_{d}}\;\left( {p_{i}^{2} - \overset{\_}{p}} \right)}}}{{{Skewness}\left( P_{d} \right)} = \frac{\frac{1}{N_{d}}{\sum\limits_{i = 1}^{N_{d}}\;\left( {p_{i} - \overset{\_}{p}} \right)^{3}}}{{Std}\;\left( P_{d} \right)^{3}}}{{{Kurtosis}\left( P_{d} \right)} = \frac{\frac{1}{N_{d}}{\sum\limits_{i = 1}^{N_{d}}\;\left( {p_{i} - \overset{\_}{p}} \right)^{4}}}{{Std}\;\left( P_{d} \right)^{4}}}} & {{Equations}\mspace{14mu} 6} \end{matrix}$

FIG. 3H is an illustration of an exemplary spanning of the features' space by two of the features described above, according to some embodiments of the invention. The features are seen to correlate with respect to different posture of the person, such as standing, sitting and laying.

Returning to FIG. 3A, posture classifier 250A receives the extracted features vector from posture features extractor 240A, the posture features vector comprising the selected set of the features that were extracted from the projections Px, Py and Pz. Classifier 250A is configured to determine whether the person is in a standing, sitting or laying posture, for example according to the following example.

A set of all the possible postures is defined as {tilde over (C)}={posture₁, posture₂, . . . , posture_(c)} with c denoting the total number of postures, for example, {tilde over (C)} may be a set of postures: {standing, sitting, laying}. X_(posture) _(i) is defined as the set of template features vectors attributed to posture_(i) and is used to train the classifier and creating the codebook, as illustrated in FIG. 3I which is a schematic block diagram of a training module 251 in posture classifier 250A, according to some embodiments of the invention. The training phase of classifier 250A may comprise preprocessing 251A, configured to scale each feature, e.g., to have the variance in each of the features as a known constant and then, using a training process 251B, creating codebook 234 (used later for the actual classification) of code-vectors, which projects the complete set of the various features vectors into a smaller subset. Training process 251B may be implemented by various methodologies, of which two are exemplified in the following in a non-limiting manner. One example is a ‘Supervised Vector Quantization (VQ)’, in which codebooks 234 are created according to the number of postures, e.g., for C=3 postures, K centroids (centroids are the centers of distributions according to a given measure, for example n-dimensional means) may be defined per posture, resulting in 3·K centroids denoted as {{μ_(k,c)}_(k=1) ^(K)}_(c=1) ³. Another example is a ‘One Codebook VQ’, in which one codebook is created for all the postures' feature vectors, without posture distinction. For example, K centroids may be defined for all the postures as {μ_(k)}k₌₁ ^(K). Moreover, for each centroid the internal distribution for each posture, denoted as the conditional probability of a posture given the centroid—P(posture_(i)|centroid_(j)), may be determined. The prior matrix, expressed in Equation 7, is defined as having rows that correspond to the centroids and columns that correspond to the probability of each posture given this centroid. The classifying phase (250A) depends on the selected training methodology and resulting codebook(s) 234, as exemplified below.

$\begin{matrix} {{PriorMatrix} = {\quad\left\lbrack \begin{matrix} {P\left( {posture}_{1} \middle| {centroid}_{1} \right)} & \ldots & {P\left( {posture}_{C} \middle| {centroid}_{1} \right)} \\ \vdots & \ddots & \vdots \\ {P\left( {posture}_{1} \middle| {centroid}_{K} \right)} & \ldots & {P\left( {posture}_{C} \middle| {centroid}_{K} \right)} \end{matrix} \right\rbrack}} & {{Equation}\mspace{14mu} 7} \end{matrix}$

FIG. 3J is a schematic block diagram of a classifying stage 252 in posture classifier 250A, according to some embodiments of the invention. In classifying phase 252, new feature vectors are entered into a preprocessing unit 252A and posture classifier 250A computes the best posture fit out of all postures represented in codebook(s) 234, e.g., by using a pre-defined cost function 252B. For example, cost function 252B of the ‘Supervised VQ’ methodology may be defined as the minimum distance across all the centroids. The classified posture is the posture attributed to the minimum distance centroid. In the second example, cost function 252B of the ‘One Codebook VQ’ methodology may be defined as in Equation 8, relating to the definitions of Equation 7, with x being the tested feature vector, P(x|centroid_(j)) calculated using the normal distribution N(x|μ_(j),Σ_(j)) and P(centriod_(j)) estimated using the total vectors attributed to each of the centroids, {circumflex over (i)}=argmax_(i) P(posture_(i) |x)=argmax_(i)[Σ_(j) P(posture_(i),centroid_(j) |x)]=argmax_(i)[Σ_(i) P(posture_(i)|centroid_(j) ,x)P(centroid_(j) |x)]≡argmax_(i)[Σ_(i) P(posture_(i)|centroid_(j))P(centroid_(j) |x)]=argmax_(i)[Σ_(i) P(posture_(i)|centroid_(j))P(x|centroid_(j))P(centriod_(j))]   Equation 8

Alternatively or complementarily, Support Vector Machine (SVM) classification may be implemented as posture classifier 250A, in which the features vectors are represented as linear lines that are formulated as a set of cost functions. An unknown test vector is evaluated by these cost functions and the classification is determined according its results.

Human Motion

Human motion—The monitored human body may create vibrations and other motions (such as gestures and gait). Therefore, it introduces frequency modulation on the returned echo signal. The modulation due to these motions is referred to as micro-Doppler (m-D) phenomena. The human body's motion feature may be extracted by estimating the micro-Doppler frequency shift vector at the target distance from the system (down range). The following description and FIGS. 4A-4N elaborate on the aspect of human motion features extraction.

It is noted that the term “motion” refers to the motion of the body and/or of body parts without displacement of the whole body as a bulk, such as gestures, limb motions, posture changes such as sitting down or standing up, gait (separated from the displacement), motion suddenness (e.g., possible fall or collapse), etc. The term “movement” refers to the displacement of a person's body as a whole, irrespective of the motion of body parts such as the limbs (in case of movement detection by backpropagation algorithms, the movement may comprise only the radial components of displacement).

Non-wearable monitoring system 100 may comprise ultra-wide band (UWB) radio frequency (RF) interferometer 120 configured to transmit UWB RF signals at, and to receive echo signals from, an environment including at least one human, processing unit 225 configured to processing derive, e.g., at a slow signal derivation module 226, a range-bin-based slow signal from the received echo signals, the slow signal being spatio-temporally characterized over a plurality of spatial range bins and a plurality of temporal sub-frames, respectively, and feature extractor 240 configured to derive from the slow signal a Doppler signature and a range-time energy signature as motion characteristics of the at least one human.

The Doppler signature may be derived by comparing spectral signatures of sub-frames in the slow signals, which are related to identify human-related range bins and sub-frames. The energy signature may derived by evaluating powers of the slow signal at identified human-related range bins and sub-frames. The Doppler signature and/or the energy signature may be derived with respect to different body parts of the at least one human.

Feature extractor 240 may be further configured to derive location data to yield movement characteristics of the at least one human. The location data may be derived by detecting displacements of the at least one human using back-projection and/or by identifying human-related range bins and sub-frames in the slow signal. The derivation of the location data may be carried out using a spatio-temporal histogram of the range-time energy signature, by identifying on the histogram range changes of at least body parts of the at least one human.

System 100 may further comprise human state classifier 250 configured to classify the motion and movement characteristics of the at least one human to indicate a state of the at least one human, and abnormality situation pattern recognition module 262, e.g., as part of cognitive situation analysis module 260 configured to generate an alert once the indicated state is related to at least one specified emergency. The classification may carried out by identification of a most probable fit of one of a plurality of predefined states to the motion and movement characteristics and wherein the alert generation is based on pattern recognition with respect to previously indicated states.

FIG. 4A is a high-level schematic flowchart illustration of exemplary human motion features extraction 241 in feature extractor 240, according to some embodiments of the invention. The Human Motion Features Extractor system receives a UWB echo signal 401 and processes it according to the following blocks. Detailed descriptions of modules in FIG. 4A are presented in consecutive figures.

Echo (fast) signal preprocessing unit 405 receives the echo signals from antennas 110-1 to 110-N. Each pulse transmission is represented by a vector that is referred to in the following as the ‘fast time signal’. The transmission-reception cycle is performed repeatedly for a frame of, e.g., T_(frame)=2 to 5 seconds at a rate of, e.g., F_(slow)=100 Hz to 300 Hz as non-limiting values. The output of unit 405 is a matrix of the received echo signals, where each row is a fast time signal of a different transmission.

Range bin based slow signal constructor (Fast2Slow) 410 rearranges the downrange echo (fast) signals vectors (the matrix rows) to represent the cross-range (slow) signals 411 (the matrix columns), as illustrated in FIG. 4B below. The slow signal vector represents a single downrange distance (bin) with a sampling rate, e.g., F_(slow)=100 Hz to 300 Hz as a non-limiting value. These vectors are referred as the ‘slow time signals’.

Human body (target) detection is carried out by detecting its representation by a range bins window of e.g., RW_(target)=50 to 200 range bins (assuming, in a non-limiting manner, that each range bin is approximately 1 cm), in a non-limiting example. The target location may be determined by the range bins window with the highest motion power among all of the RW_(target) bins windows. The slow signal may be preprocessed for each range bin separately and may include DC removal, which is done by the subtraction of the estimated average DC signal from the original signal as well as other optional signal adjustments for example gain and phase mismatch correction between all the range bins slow signals and out-of-band noise reduction filtering.

Feature extraction 241 may be separated into two components—motion Doppler characteristics derivation 420A (motion Doppler features) and motion change over range bins and time characteristics derivation 420B (motion energy features). Motion features extraction 241 yields a motion features vector 440 which is then used for further processing and classification in classifiers 130 and/or 250. The following demonstrates in a non-limiting manner possible embodiments of derivations 420A, 420B.

Motion characteristics detection 420 may comprise deriving from the slow signal a Doppler signature, e.g., by block 420A, and a range-time energy signature, e.g., by block 420B, as motion characteristics of the at least one human

Motion characteristics detection 420 may comprise, concerning derivation of Doppler signature 420A, Doppler preprocessing and segmentation 422 in which the slow signal frame is divided into M_(subframes) sub-frames using Equation 13 (see below). The spectrogram may be generated by fast Fourier transform (FFT) for each slow time signal sub-frame within the human target range. A maximal Doppler frequency extractor 424 may use the maximum Doppler frequency to identify the instantaneous moment and range that a rapid motion (such as falling) has occurred. This feature is extracted by scanning all the slow time signal sub-frames per each range bin and accumulating the related power spectrum with the highest motion (Doppler) frequency that is selected out of each range bin. The maximal Doppler feature is extracted from the accumulated range bins power spectrums. A Motion Energy Extractor 426 may estimate the motion energy features in the frequency domain There are a few features that are extracted to better represent the overall motion energy.

Motion characteristics detection 420 may comprise, concerning derivation of energy signature 420B, Range over Time preprocessing and segmentation 432 in which the signal is preprocessed and segmentation of the data into histograms is performed. For example, at a first stage, a Dynamic Time Wrapping (DTW) process may be implemented to estimate the human motion path along the range bins window and at a second stage, e.g., three histograms, which contain information about the distribution of the motion activity and energy signature over range, are generated to represent: (i) Cumulated energy of all the range bins selected; (ii) The numbers of appearances of each range bin in the top 5 range bins; and (iii) The number of average energy for each range bin that appeared in the top 5 ranges bins list. For each histogram, a set of features may be extracted to represent the histogram form factor, for example: (i) Motion energy distribution analysis 434 which comprises the extraction of features that represent the distribution of the energy over the range bins, carried out e.g., by using the energy distribution histogram analysis over range bins; (ii) Motion over range distribution analysis 436 to represent the distribution of the active range bins during the motion period and helps determine if the motion is stationary in space or distributed among several range bins; and (iii) Motion route energy estimator 438 which extracts the motion route energy by accumulating the power over the motion path (the selected range bins power as a result of the DTW at the pre-processing unit).

FIG. 4B is a high-level schematic illustration of fast and slow signal mapping 410, 411, according to some embodiments of the invention. The received preprocessed fast signals are mapped in a two dimensional matrix X (Equation 9). Each echo sample is an element on the matrix [n][k]; n=1 . . . N_(Ranges); k=1 . . . K_(Sampels), where n is the downrange bin indicator of spatial range bin, and k is the cross-range (slow) time indicator of temporal bins. The number of total range bins is determined by the scanning window, while each range bin represents C/F_(fast) meters (F_(fast) is the echo signal sampling rate). The matrix is separated into its rows. Each row x_(n) is the echo signal from the same range from the interferometer (radar), sampled in F_(slow)=250 Hz. Those vectors are referred as the slow time signals.

$\begin{matrix} {{X\left( {{x\lbrack n\rbrack}\lbrack k\rbrack} \right)} = \begin{bmatrix} {{x\lbrack 1\rbrack}\lbrack 1\rbrack} & \ldots & {{x\lbrack 1\rbrack}\lbrack K\rbrack} \\ \vdots & \ddots & \vdots \\ {{x\lbrack N\rbrack}\lbrack 1\rbrack} & \ldots & {{x\lbrack N\rbrack}\lbrack K\rbrack} \end{bmatrix}} & {{Equation}\mspace{14mu} 9} \end{matrix}$

FIG. 4C is a high-level schematic flowchart illustration of exemplary human body target detection 452, according to some embodiments of the invention. Human Body Target Detection unit 452 narrows the focus of the analysis to the relevant range bins with human presence. Unit 452 may operate with various inputs, according to the required features to be extracted. The process of the target detection given the slow time signals of all the N range bins is performed by the following blocks, as an exemplary embodiment. A range bin power calculator 452A calculates the power of each slow time vector by Equation 10, where k and n are the time and range bin indicators respectively, to yield N power values. p[n]=Σ _(k=1) ^(K) x _(n) ² [k] for n=1 . . . N _(Ranges)  Equation 10 Following, the power sequence over a sliding window of RW_(target) range bins is calculated along the (N_(Ranges)−RW_(target)+1) windows (Eq. 2.2) and accumulated by accumulator 452B, according to Equation 11. s[n]=Σ _(j=0) ^(M-1) p[j+n] for n=1 . . . (N _(Ranges)−RW_(target)+1)  Equation 11 Finally, the human target location region is detected 452C and indicated at the most powerful windowed power as expressed in Equation 12. Windicator=argmax_(n)(s[n])  Equation 12

FIG. 4D is a high-level schematic flowchart illustration of an exemplary slow signal preprocessing unit 454, according to some embodiments of the invention. It is noted that FIG. 4D is similar to FIG. 3B presented above, and is repeated here to maintain the flow of explanation in the present context. The signal processing itself may be similar or differ in details with respect to posture features extraction. The slow time signal preprocessing may be carried out in a generic unit having its input determined by the extracted features (e.g., of features vector 440) and optionally operating on each slow time signal separately. Preprocessing unit 454 may perform the following blocks: (i) Adaptive DC removal 454A by continuously calculating the estimated DC signal (time varying DC) for each time bin by Equation 13, using the current slow signal vector x[k], s[k]=(1−a)s[k−1]+ax[k],k=1 . . . K _(Samples)  Equation 13 where α, is the learning coefficient. The estimated DC signal is subtracted from the original signal, namely y[k]=x[k]−s[k]. Gain mismatch correction 454B may optionally be performed to the selected range bins' slow signals to compensate the path losses differences among the selected range bins. The additional path loss of R_(i) versus R_(min) may be calculated as

${{\Delta\;{P.L.\lbrack{dB}\rbrack}} = {20{\log\left( \frac{R_{i}}{R_{\min}} \right)}}},$ where R_(i) is the range bin i distance out of the selected set of range bins and R_(min) is the first (closest) range bin. A slow signal phase mismatch correction 454C among the selected range bins may be carried out to compensate for the motion offset over the time/range bin. That is, the same motion profile may be preserved between neighbor range bins with a delayed version. The slow signal phase mismatch correction may estimate the phase error between SlowSig_(Ri) and SlowSig_(Rref), where SlowSig_(Ri) is the slow signal of range bin R_(i), and SlowSig_(Rref) is the slow signal that is considered the reference range bin out of the selected range bins. Optionally, an out of band (O.O.B.) noise reduction filter 454D may be enabled to filter out the irrelevant slow signal components or interferences that might influence the performance of the various energy based features extraction.

FIG. 4E is a high-level schematic flowchart illustration of exemplary Doppler preprocessing and segmentation 422, according to some embodiments of the invention. A spectrogram 422A for each range bin may be generated and used for extraction of signal's spectral features, for every short time period, termed herein sub-frame (e.g., a plurality of specific fast signal times, i.e., a range of k values). The sub-frame period should be short enough to consider the motion as stationary). In order for a spectrogram to be created from a slow time signal vector of a specific range bin in the target region, slow time signal 411 of each range bin is first preprocessed by a preprocessing unit 422B, and then a human motion target detection unit 422C is being used to find the target range bin. Spectrogram 422A of each range bin is generated by segmenting the original signal x[k] to M_(subframes) sub-vectors. For a given window size and number of overlaps, a new vectors group is constructed according to Equation 14. {v _(m) [i] }={x[L _(step) m+i]};i=1 . . . D; m=1 . . . M _(subframe)  Equation 14 Each vector may have, as a non-limiting example, D_(winSize) may be between 50 and 200 samples (equivalent a subframe length of between 0.15 and 2 seconds) with overlaps of (D_(Winsize)−L_(step)) samples from the previous vector in the sequence (L_(step)=is the samples step size between subframes). Then, a power spectrum V_(m) may be computed for each sub-frame by Equation 15, where h is a hamming window with length D. {V _(m) }=FFT{v _(m) ·h}; m=1 . . . M _(subframes)  Equation 15 This process is repeated for every range bin within the target region. RW_(target) spectrograms 422D are gathered for further processing.

FIG. 4F is a high-level schematic flowchart illustration of an exemplary maximal Doppler frequency extraction 424A, according to some embodiments of the invention. Maximal Doppler frequency extractor 424 is configured to find the highest velocity of the motion, which is represented by the motion's Doppler frequency along the motion's down-range route. The timing of the human peak motion activity is not common for all range bins, due to the fact that the motions can cross range bins versus the time. Therefore, the maximal Doppler frequency feature is extracted by scanning all the slow time signal sub-frames per each range bin and accumulating the related power spectrum with the highest motion (Doppler) frequency that is selected out of each range bin. The max Doppler feature may be extracted from the accumulated range bins power spectrums. In order to extract the action power spectrum by extractor 424B from each range bin spectrogram, the following process is performed: Noise level threshold estimation computation 424C calculates the noise level threshold for the spectrogram energy by considering the spectrogram values below noise level are considered as not related to human motion. A threshold T₁ (measured in dB) may be determined by Equation 16, using the mean of the upper four frequency bins of the spectrogram, while Q, P are respectively the numbers of frequency and time bins of the spectrogram matrix S.

$\begin{matrix} {{T_{1} = {\frac{1}{4}{\sum\limits_{q = {({Q - 4})}}^{q = Q}{\frac{1}{P}{\sum\limits_{p = 1}^{p = P}{s\left\lbrack {p,q} \right\rbrack}}}}}},{s \in S}} & {{Equation}\mspace{14mu} 16} \end{matrix}$

The maximal motion frequency bin is defined and estimated in 424D, as the first frequency bin to its power below the motion threshold when scanning the spectrum from F_(min) to F_(max) which is the motion (active) region for the p power spectrum as defined by Equation 17. f _(p)=argmin_(q)(s[q,p]<(T ₁+1)) for p=1 . . . P  Equation 17 where f_(p) is the maximal frequency at the p power spectrum that its power is <T₁+1 dB. An example for that region from a full spectrogram can be seen in spectrogram 424E of FIG. 4G.

FIG. 4G is an exemplary illustration of a spectrogram 424E of motion over a single range bin in the active area, according to some embodiments of the invention. Action power spectrum extractor 424B further carries out a selection 424F of the power spectrum with the highest frequency—the selected power spectrum at time bin p is the one that has the highest value of f_(p) (referred as action power spectrum of range q). This power spectrum is extracted for farther analysis. The averaged action power spectrum P_(av) is created (424G) using action power spectrums 424E from all range bins. Then, a new noise threshold T₂ is calculated from Equation 18, by using the average value of the upper four frequency bins of the averaged (accumulated) power spectrums, in a non-limiting example.

$\begin{matrix} {T_{2} = {\frac{1}{4}{\sum\limits_{q = {({Q - 4})}}^{q = Q}{P_{av}\lbrack q\rbrack}}}} & {{Equation}\mspace{14mu} 18} \end{matrix}$ The maximal frequency feature is calculated by Equation 19: f _(max)=argmin_(q)(P _(av) [q]<(T ₂+1))  Equation 19

FIG. 4H is a high-level schematic flowchart illustration of an exemplary motion energy features extractor 426, according to some embodiments of the invention. The motion energy features may be estimated in the frequency domain. There are a few features that are extracted to better represent the overall motion energy. The motion energy might be affected by several conditions which are not related to the motion itself. For example, the relative distance from the interferometer as well as the motion duration. Unit 426 may create several spectrograms for all the target range bins to extract the various features that represent the energy signature.

The motion energy features may be extracted by the following exemplary process. Two spectrogram versions may be created for each target range bin. The first spectrogram may be created after a range gain mismatch correction (to compensate the path loss variations over the range bins). The other spectrogram may be created without the gain mismatch correction (426A). The gain mismatch may be implemented at preprocessing unit 422B. Therefore, two spectrogram sets are created for the complete range bins {S_(1n)} and {S_(2n)}. For each set of spectrograms, an average spectrogram 426B S_(1av) S_(2av) may be created by Equation 20.

$\begin{matrix} {{{{{S_{i,{av}}\lbrack q\rbrack}\lbrack m\rbrack} = {\frac{1}{RW}{\sum\limits_{n = 1}^{RW}{{S_{n}\lbrack q\rbrack}\lbrack m\rbrack}}}};{{{for}\mspace{14mu} i} = 1}},{{2m} = {1\mspace{14mu}\ldots\mspace{14mu} M_{subframes}}},{q = {1\mspace{14mu}\ldots\mspace{14mu} Q_{freqbins}}}} & {{Equation}\mspace{14mu} 20} \end{matrix}$ In order to emphasize the motion power in higher frequencies, each averaged spectrogram frequency bin {right arrow over (S)}_(av) ^(q) may be processed with a corresponding weight, into a new weight-averaged spectrogram 426C by Equation 21.

$\begin{matrix} {{{\overset{\rightarrow}{SW}}_{av}^{q} = {{\overset{\rightarrow}{S}}_{av}^{q}*\sqrt{\frac{f\lbrack q\rbrack}{f_{\max}}}}};{{{for}\mspace{14mu} q} = {1\mspace{14mu}\ldots\mspace{14mu} Q_{freqbins}}}} & {{Equation}\mspace{14mu} 21} \end{matrix}$ where f[q] is the frequency value of the q frequency bin, and f_(max) is the maximal frequency bin value. Two vectors of the power peaks may be created (426D) for each the two spectrograms, with and without power correction. A first vector {right arrow over (p)}₁ contains the maximal power of each sub-frame vector {right arrow over (s)}_(av) ^(m) (Equation 22A), and the second vector {right arrow over (p)}₂ contains the maximal values of each frequency bin vector {right arrow over (s)}_(av) ^(q) (Equation 22B). p ₁ [m]=max({right arrow over (s)} _(av) ^(m)); for m=1 . . . M _(subframes)  Equation 22A p ₂ [m]=max({right arrow over (s)} _(av) ^(q)); for m=1 . . . Q _(subframes)  Equation 22B Each of the four (2×2) vectors—with the different procedures for gain processing and for maximal power values extraction, are accumulated into four energy features.

FIG. 4I is a high-level schematic flowchart illustration of an exemplary range-time preprocessing and segmentation flow 432 as part of derivation of energy signature 420B, according to some embodiments of the invention. The motion features over range-time helps profiling the energy signature of each motion, not only by characterizing its power and velocity, but by also characterizing its distribution over space along the motion time. Module 432 may be configured to create three histograms that express the distribution of motion energy and activity over the range bins during motion period. The energy related histograms may be created by the following algorithms. After normalization of the slow time matrix X (defined in Equation 9) using the highest absolute value amplitude as

$X = \frac{X}{{X}_{\infty}}$ (the notation X is maintained for simplicity), the target region is located by a target region detector unit 432A. The two axis of the slow time matrix X correspond to the time of the sample (time bin) and the range of the sample (range bin). Each range bin vector, with K_(samples)=T_(frame) F_(slow) length as an example, may then segmented into 10 sub-frames as a non-limiting example and mapped in as new matrix X_(n) defined in Equation 23, with K_(Samples)=800 as a non-limiting example, with each row of the new matrix having K_(Sample)/10 samples with an overlap.

$\begin{matrix} {X_{n} = \begin{bmatrix} {x_{n}\lbrack 1\rbrack} & \ldots & {x_{n}\lbrack 80\rbrack} \\ \vdots & \ddots & \vdots \\ {x_{n}\lbrack 720\rbrack} & \ldots & {x_{n}\lbrack 800\rbrack} \end{bmatrix}} & {{Equation}\mspace{14mu} 23} \end{matrix}$ With E_(n) being the temporal energy vector of each range bin as calculated in Equation 24, j being the sub-frame number.

$\begin{matrix} {{E_{n}\lbrack j\rbrack} = {{\sum\limits_{i = 1}^{i = \frac{K}{10}}{{{X\;{n\left\lbrack {j,i} \right\rbrack}}}^{2}\mspace{14mu}{for}\mspace{14mu} n}} = {1\mspace{14mu}\ldots\mspace{14mu} R\; W_{target}}}} & {{Equation}\mspace{14mu} 24} \end{matrix}$ A Matrix E defined by Equation 25 is constructed by gathering all the temporal energy vectors from each range bin.

$\begin{matrix} {E = \begin{bmatrix} E_{1} \\ \vdots \\ E_{N} \end{bmatrix}} & {{Equation}\mspace{14mu} 25} \end{matrix}$ The columns of E are the energies of all the ranges along the new wider time bins, and the rows are the energy of a specific bin along time. From each column with indicator k, the five highest elements values may be extracted into w_(k)(r),r=1 . . . 5; together with their row indexes g_(k)(r), as a non-limiting example. The three histograms 432B are created from elements w_(k)(r) as defined by Equations 18A-C.

An accumulated range histogram with elements calculated by Equation 26A: h _(acc)(n)=Σ_(k=1) ^(k=K) w _(k)(r)*I _((g) _(k) _((r)=n)) for n=1 . . . RW _(target)  Equation 26A The indicator function I_((ωεΩ)) is equal to 1 if the condition in the brackets is true. An activity in range over time histogram, with elements calculated by Equation 26B: h _(app)(n)=ΣΣ_(k=1) ^(k=K) I _((g) _(k) _((r)=n)) for n=1 . . . RW_(target)  Equation 26B A normalized energy histogram, with elements calculated by Equation 26C:

$\begin{matrix} {{h_{norm}(n)} = \left\{ \begin{matrix} \begin{matrix} {\frac{h_{acc}(n)}{h_{app}(n)},{{h_{app}(n)} > 0}} \\ {0,{{h_{app}(n)} = 0}} \end{matrix} & {{{for}\mspace{14mu} n} = {1\mspace{14mu}\ldots\mspace{14mu}{RW}_{target}}} \end{matrix} \right.} & {{Equation}\mspace{14mu} 26C} \end{matrix}$

FIG. 4J is a high-level schematic flowchart illustration of an exemplary range energy distribution analysis 434 as part of derivation of energy signature 420B, according to some embodiments of the invention. Range energy distribution analysis module 434 extracts features from the accumulated and normalized energy histograms, which relate to the amount and distribution of motion energy over the range bins. Range energy distribution analysis 434 includes the extraction of the total and maximal (peak) energy over the range bins out of the histogram. In addition, the histogram form factor, defined as the percentage accumulated distribution points, is extracted (for example I20—identifies the range bin point that covers 20% of the accumulated motion energy, I40—identifies the range bin point that covers 40% of the accumulated motion energy, etc.).

FIG. 4K is a high-level schematic flowchart illustration of an exemplary motion over range distribution analysis 436, according to some embodiments of the invention. Motion over range distribution analysis unit 436 extracts features that relate to the distribution of active range bins over time, which is related to the motion's and varying over the down range. Unit 436 extracts the number of times that most of active region has been selected, the total number of active range bins and the mean number of repeated selection of the range bin as an active region.

FIG. 4L is a high-level schematic flowchart illustration of an exemplary motion route energy estimation 438, according to some embodiments of the invention. The motion route energy is defined as the accumulated power along the motion route in the range bins window during the motion period (time) relatively to the overall energy. This feature may be extracted in two major stages: (i) Estimating the motion route by using a dynamic Time Warping (DTW) approach 438A, (ii) accumulating the estimated power along the selected range bin route, and normalizing by the overall power 438B and calculating the motion route peak to average power ratio 438C. The DTW may be performed by selecting the highest range bin power for every sub-frame, as illustrated in FIG. 4M, being a schematic matrix illustration of DTW-based motion route estimation 438A, according to some embodiments. The relative Motion Route Energy (MRE) may be calculated as expressed in Equation 27:

$\begin{matrix} {{MRE} = \frac{\sum\limits_{m = 1}^{Msubframes}{{MP}\lbrack m\rbrack}}{\sum\limits_{m = 1}^{Msubframes}{\sum\limits_{r = 1}^{R}P_{\lbrack{m,r}\rbrack}}}} & {{Equation}\mspace{14mu} 27} \end{matrix}$ Where: P_([m,r])=E_(n=1) ^(N)|x_(m,r[n])|λ² is the power of subframe m, and Range bin r; x_(m,r[n])—is the Slow signal at subframe m and range bin r; and MP[m]=max{P_([m,r])}, rεWindow Range bins, is the Max Power at subframe m.

The motion route Peak to Average Power Ratio (PAR), measured by the ratio between maximal and average power of the motion route, may be calculated as in Equation 28:

$\begin{matrix} {{PAR} = \frac{\max_{m}\left( {{MP}\lbrack m\rbrack} \right)}{\frac{1}{Msubframes}{\sum\limits_{m = 1}^{Msubframes}{{MP}\lbrack m\rbrack}}}} & {{Equation}\mspace{14mu} 28} \end{matrix}$

Human breathing—During the breathing (respiration) the chest wall moves. The average respiratory rate of a healthy adult is usually 12-20 breaths/min at rest (˜0.3 Hz) and 35-45 breaths/min (˜0.75 Hz) during labored breathing. The breathing frequency feature is extracted by estimating the spectrum on the slow-time sampled received echo signal at the target distance (down range) from the system.

The features vector is prepared by quantizing the extracted features with a final number of bits per field and adding the time stamp for the prepared vector. This vector is used as the entry data for the human state classifier (for both training and classifying stages).

FIG. 4N is a schematic illustration of the possibility to separate different types of motions based on the derived parameters, according to some embodiments of the invention.

The two illustrations in FIG. 4N are of the same 3D graphics and are taken from different angles to illustrate the separation of the two types of motion in the 3D parameter space. FIG. 4N clearly illustrates the ability of the analysis described above to separate motions that are categorized, in the non-limiting illustrated case, as fall motions and as regular motions. The results may be used independently to detect falls, or be provided to the classifier for verification and augmentation with additional data and analysis results. Classification of the human state, as described in detail below, may relate to the derived motion characteristics as well as optionally to posture characteristics, respiration characteristics and position characteristics that may be derived from the received echo signals by implementing the disclosed methods, approaches and/or additional analysis of the received echo signals.

Human State Classifier

The Human state classifier is a VQ (Vector Quantization) based classifier. This classifier consists of two main phases: (i) Training phase—it's done offline (supervised training) and online (unsupervised training), where a stream of features vectors reflecting various states are used as a preliminary database for vector quantization and finding the set of code-vectors (centroids) that sufficiently representing the instantaneous human states. The set of the calculated code-vectors are called codebook. Some embodiments of the training sessions are provided in more details hereinafter. (ii) Classifying phase—it's executed during the online operation while an unknown features vector is entered into the classifier and the classifier determines what the most probable state that it represents. The classifier output is the determined states and the set of the measured statistical distances (probabilities), i.e., the probability of State-i given the observation-O (the features vector). The aforementioned probability scheme may be formulated by: P (Si|O). The determined instantaneous state is called “Local Decision”. The VQ states are defined as the set of instantaneous states at various locations at the monitored home environment. Therefore, any state is a 2 dimension results which is mapped on the VQ state matrix. (iii) The State matrix consists of the state (row) and location (Column) followed by a time stamp. Typical elderly home environment consists of the specific locations (Primary zones) and others non-specified locations (Secondary zones). State is defined as the combination of posture/motion at a specific location (e.g., S21 will indicate sleeping at Bedroom).

FIG. 5A is a table 134 illustrating an exemplary states definition in accordance with some embodiments of the present invention. FIG. 5B is a table 135 illustrating an exemplary states matrix in accordance with some embodiments of the present invention.

Cognitive Situation Analysis (CSA)

The CSA's objective is to recognize the abnormal human patterns according to a trained model that contains the possible abnormal cases (e.g., fall). The core of the CSA, in this embodiment, may, in a non-limiting example a Hidden Markov Model (HMM) based pattern recognition. The CSA engine searches for states patterns that are tagged as an emergencies or abnormal patterns. These predefined patterns are stored in a patterns codebook. The output of the CSA is the Global recognized human situation.

FIG. 6 is a table 136 illustrating exemplary abnormal patterns in accordance with some embodiments of the present invention. It can be seen that in the first abnormal case (Critical fall), it appears that the person was sleeping in the leaving room (S25), then was standing (S45) and immediately fell down (S65). He stayed on floor (S15) and start being in stress due to high respiration rate (S75). The CSA may contain additional codebook (irrelevant codebook) to identify irrelevant patterns that might mislead the system decision.

Communication Unit

The communication unit creates the channel between the system and the remote caregiver (family member or operator center). It may be based on either wired (Ethernet) connectivity or wireless (e.g., cellular or WiFi communication or any other communication channel).

The communication unit provides the following functionalities: (i) This unit transmits any required ongoing situation of the monitored person and emergency alerts. (ii) It enables the two way voice/video communication with the monitored person when necessary. Such a communication is activated either automatically whenever the system recognizes an emergency situation or remotely by the caregiver. (iii) It enables the remote system upgrades for both software and updated codebooks (as will be in further detail below). (iv) It enables the communication to the centralized system (cloud) to share common information and for further big data analytics based on multiple deployments of such innovated system.

FIG. 7 is a diagram illustrating cloud-based architecture 700 of the system in accordance with embodiments of the present invention. Raw data history (e.g., states stream) is passed from each local system 100A-100E to the central unit located on a cloud system 710 and performs various data analysis to find correlation of states patterns among the multiple users' data to identify new abnormal patterns that may be reflected just before the recognized abnormal pattern. New patterns code vectors will be included to the CSA codebook and cloud remotely updates the multiple local systems with the new code-book. The data will be used to analyze daily operation of local system 100A-100E.

FIG. 8 is a diagram illustrating a floor plan 800 of an exemplary residential environment (e.g., an apartment) on which the process for the initial training is described herein. The home environment is mapped into the primary zones (the major home places that the monitored person attends most of the time as bedroom 810, restroom 820, living room 830 and the like) and secondary zones (the rest of the barely used environments). The VQ based human state classifier (described above) is trained to know the various primary places at the home. This is done during the system setup while the installer 10A (being the elderly person or another person) stands or walks at each primary place such as bedroom 810, restroom 820, and living room 830 and let the system learns the “fingerprint” of the echo signals extracted features that mostly represents that place. These finger prints are stored in the VQ positions codebook. In addition, the system learns the home external walls boundaries. This is done during the system setup while the installer stands at various places along the external walls and lets the system tune its power and processing again (integration) towards each direction. For example, in bedroom 810, installer 10A may walk along walls in route 840 so that the borders of bedroom 810 are detected by tracking the changes in the RF signal reflections throughout the process of walking. A similar border identification process can be carried out in restroom 820, and living room 830. Finally, the system learns to identify the monitored person 10B. This is done by capturing the fingerprint of the extracted features on several conditions, such as (1) while the person lays at the default bed 812 (where he or she is supposed to be during nighttime) to learn the overall body volume, (2) while the person is standing to learn the stature, and (3) while the person walks to learn the gait. All the captured cases are stored in the VQ unit and are used to weight the pre-trained codebooks and to generate the specific home/person codebooks. According to some embodiments, one or additional persons such as 20 can also be monitored simultaneously. The additional person can be another elderly person with specific fingerprint or it can be a care giver who needs not be monitored for abnormal postures.

FIG. 9 is a diagram illustrating yet another aspect in accordance with some embodiments of the present invention. System 900 is similar to the system described above but it is further enhanced by the ability to interface with at least one wearable medical sensor 910A or 910B coupled to the body of human 10 configured to sense vital signs of human 10, and a home safety sensor 920 configured to sense ambient conditions at said specified area, and wherein data from said at least one sensor are used by said decision function for improving the decision whether an abnormal physical event has occurred to the at least one human in said specified area. The vital signs sensor may sense ECG, heart rate, blood pressure, respiratory system parameters and the like. Home safety sensors may include temperature sensors, smoke detector, open door detectors and the like. Date from all or some of these additional sensors may be used in order to improve the decision making process described above.

FIG. 10 is a high level schematic flowchart of a method 600 according to some embodiments of the invention. Method 600 may be at least partially implemented by at least one computer processor. Certain embodiments comprise computer program products comprising a computer readable storage medium having computer readable program embodied therewith and configured to carry out of the relevant stages of method 600.

Method 600 may comprise transmitting UWB RF signals via transmitting antenna(s) at a specified area (such as an environment including at least one human) and receiving echo signals via receiving antenna(s) (stage 500). At least one of the UWB RF transmitting and receiving antennas comprises a Synthetic Aperture Antenna Array (SAAA) comprising a plurality of linear baseline antenna arrays (“baselines”). Method 600 may comprise configuring receiving antenna(s) and/or transmitting antennas(s) as a Synthetic Aperture Antenna Array (SAAA), such as baseline(s) (stage 510), for example, method 600 may comprise configuring the UWB RF receiving SAAA as a plurality of linear baseline antenna arrays arranged in a rectangle as a non-limiting example, possibly parallel to edges thereof or at acute angles to edges thereof (stage 520), e.g., as illustrated below in a non-limiting manner Method 600 may comprise designing at least one of the linear baseline antenna arrays to comprise two (or more) parallel metal beams flanking the antenna elements of the baseline, to widen the baseline's field of view (stage 525).

Method 600 may further comprise using multiple antennas to implement virtual displacement of the baselines (stage 530), i.e., virtually displacing transmitting or receiving baselines to enhance performance (stage 535). Method 600 may further comprise implementing phase-shifting-based integration (back-projection) to derive parameters relating to the human(s) (stage 540), such as location, movement and/or posture features.

Method 600 may further comprise canceling environmental clutter (stage 605), e.g., by filtering out static non-human related echo signals (stage 606), extracting from the filtered echo signals, a quantified representation of position postures, movements, motions and breathing of at least one human located within the specified area (stage 610), identifying a most probable fit of human current state that represents an actual human instantaneous status (stage 690) and applying a pattern recognition based decision function to the identified states patterns and determine whether an abnormal physical event has occurred to the at least one human in the specified area (stage 693) (see additional details below).

Method 600 may further comprise finding the best match to a codebook which represents the state being a set of human instantaneous condition/situation which is based on vector quantized extracted features (stage 691).

Method 600 may further comprise ensuring, by the filtering out, that no human body is at the environment, using static clutter estimation and static clutter subtraction (stage 607).

Method 600 may further comprise quantizing the known states features vectors and generating the states code-vectors (stage 692A), measuring the distance between unknown tested features vectors and pre-defined known code-vectors (stage 692B) and finding the best fit between unknown tested features vector and pre-determined code-vectors set, using the most probable state and the relative statistical distance to the tested features vector (stage 692C).

Method 600 may further comprise generating the set of abnormal states patterns as a reference codebook, a set of states transition probabilities, and a states-patterns matching function to find and alert on a match between a tested states pattern and the pre-defined abnormal pattern of the codebook (stage 694). Method 600 may further comprise communicating an alert upon determining of an abnormal physical event (stage 695).

Method 600 may further comprise estimating the reflected clutter from a specific voxel to extract the human position and posture features (stage 612A), extracting the human motions and breathing features using Doppler signatures (stage 612B) and creating a quantized vectors of the extracted features (stage 612C).

Method 600 may comprise, with respect to posture (and position) features extraction 612A, processing the received echo signals to derive a spatial distribution of echo sources in the environment using spatial parameters of the transmitting and/or receiving antennas (stage 620), carried out, e.g., by a back-projection algorithm, and estimating a posture of the at least one human by analyzing the spatial distribution with respect to echo intensity (stage 628). Method 600 may comprise canceling environmental clutter by filtering out static non-human related echo signals (see stages 605, 606).

Processing 620 may be carried out with respect to multiple antenna baselines as the transmitting and/or receiving antennas, as explained above. Method 600 may further comprise enhancing echoes received from a lower level in the environment to enhance detection sensitivity to a laying posture of the at least one human (stage 622).

Method 600 may further comprise detecting a position of the at least one human from the spatial distribution (stage 624) and possibly tracking the detected position over time. Canceling environmental clutter 605 may be carried out by spatially characterizing the static non-human related echo signals during an absence of the at least one human from the environment, as detected by the tracking (stage 626).

Posture estimation 628 may comprise analyzing the spatial distribution using curve characteristics of one or more projections of an intensity of the received echo signals onto at respective one or more axes (stage 630), in particular with respect to one or more horizontal axis and a vertical axis. The spatial distribution may be defined using voxels and the posture may be estimated 628 using high-power voxels as defined by a specified power threshold (stage 632).

Method 600 may further comprise classifying the posture characteristics of the at least one human to indicate a state of the at least one human (stage 634), possibly by preparing at least one codebook during a training phase and using the at least one codebook to classify the detected postures (stage 636), as explained above. As explained below, Classification 634 may be carried out by identifying a most probable fit of one of a plurality of predefined states to the motion characteristics (stage 692C), possibly followed by generating an alert once the indicated state is related to at least one specified emergency (stage 695), the alert generation being possibly based on pattern recognition with respect to previously indicated states (stage 694).

Method 600 may further comprise processing the received echo signals to yield a range-bin-based slow signal that is spatio-temporally characterized over a plurality of spatial range bins and a plurality of temporal sub-frames, respectively (stage 640) and deriving from the slow signal a Doppler signature and a range-time energy signature as motion characteristics of the at least one human (stage 650). Method 600 may comprise deriving the Doppler signature by comparing spectral signatures of sub-frames in the slow signals, which are related to identified human-related range bins and sub-frames (stage 642) and deriving the energy signature by evaluating powers of the slow signal at identified human-related range bins and sub-frames (stage 644). Method 600 may comprise deriving the Doppler signature and/or the energy signature with respect to different body parts of the at least one human (stage 646).

Deriving 650 may further comprise deriving location data as movement characteristics of the at least one human (stage 652). Deriving of the location data 652 may comprise detecting displacements of the at least one human using back-projection (stage 654), using the received echo signals to derive, by back projection, 2D location data and 3D posture data about the at least one human (stage 655), and/or identifying human-related range bins and sub-frames in the slow signals (stage 656). Deriving of the location data 652 may be carried out using a spatio-temporal histogram of the range-time energy signature and by identifying on the histogram range changes of at least body parts (e.g., limbs) of the at least one human (stage 658). The motion characteristics and/or movement characteristics may comprise gait parameters.

Method 600 may further comprise handing over detecting 654 among a plurality of interferometry units according to detected displacements (stage 660), i.e., using different interferometry units for detection 654 according to displacement parameters, such as coverage region, signal intensity etc., as explained below. Method 600 may be carried out by a plurality of UWB RF receiving SAAAs positioned at a plurality of positions, and may further comprise integrating the received echo signals from the UWB RF receiving SAAAs (stage 662).

Method 600 may comprise classifying the position and/or posture and/or motion and/or movement and/or respiration characteristics of the at least one human to indicate a state of the at least one human (stage 688). Classification 688, e.g., by identifying the most probable fit 690, may be carried out by identifying a most probable fit of one of a plurality of predefined states to the motion characteristics.

Communicating the alert 695 may be carried out by generating the alert once the indicated state is related to at least one specified emergency. The alert generation may be based on pattern recognition with respect to previously indicated states.

Aspects of the present invention are described above with reference to flowchart illustrations and/or portion diagrams of methods, apparatus (systems) and computer program products according to embodiments of the invention. It will be understood that each portion of the flowchart illustrations and/or portion diagrams, and combinations of portions in the flowchart illustrations and/or portion diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions/acts specified in the flowchart and/or portion diagram portion or portions.

These computer program instructions may also be stored in a computer readable medium that can direct a computer, other programmable data processing apparatus, or other devices to function in a particular manner, such that the instructions stored in the computer readable medium produce an article of manufacture including instructions which implement the function/act specified in the flowchart and/or portion diagram portion or portions.

The computer program instructions may also be loaded onto a computer, other programmable data processing apparatus, or other devices to cause a series of operational steps to be performed on the computer, other programmable apparatus or other devices to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide processes for implementing the functions/acts specified in the flowchart and/or portion diagram portion or portions.

The aforementioned flowchart and diagrams illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present invention. In this regard, each portion in the flowchart or portion diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that, in some alternative implementations, the functions noted in the portion may occur out of the order noted in the figures. For example, two portions shown in succession may, in fact, be executed substantially concurrently, or the portions may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each portion of the portion diagrams and/or flowchart illustration, and combinations of portions in the portion diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems that perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.

In the above description, an embodiment is an example or implementation of the invention. The various appearances of “one embodiment”, “an embodiment”, “certain embodiments” or “some embodiments” do not necessarily all refer to the same embodiments. Although various features of the invention may be described in the context of a single embodiment, the features may also be provided separately or in any suitable combination. Conversely, although the invention may be described herein in the context of separate embodiments for clarity, the invention may also be implemented in a single embodiment. Certain embodiments of the invention may include features from different embodiments disclosed above, and certain embodiments may incorporate elements from other embodiments disclosed above. The disclosure of elements of the invention in the context of a specific embodiment is not to be taken as limiting their use in the specific embodiment alone. Furthermore, it is to be understood that the invention can be carried out or practiced in various ways and that the invention can be implemented in certain embodiments other than the ones outlined in the description above.

The invention is not limited to those diagrams or to the corresponding descriptions. For example, flow need not move through each illustrated box or state, or in exactly the same order as illustrated and described. Meanings of technical and scientific terms used herein are to be commonly understood as by one of ordinary skill in the art to which the invention belongs, unless otherwise defined. While the invention has been described with respect to a limited number of embodiments, these should not be construed as limitations on the scope of the invention, but rather as exemplifications of some of the preferred embodiments. Other possible variations, modifications, and applications are also within the scope of the invention. Accordingly, the scope of the invention should not be limited by what has thus far been described, but by the appended claims and their legal equivalents. 

The invention claimed is:
 1. A method comprising: transmitting, via at least one transmitting antenna, ultra-wide band (UWB) radio frequency (RF) signals at an environment including at least one human, and receiving, via at least one receiving antenna, echo signals from the environment, processing the received echo signals to derive a spatial distribution of echo sources in the environment using spatial parameters of the at least one transmitting and/or receiving antennas, and estimating a posture of the at least one human by analyzing the spatial distribution with respect to echo intensity, wherein the posture estimation comprises analyzing the spatial distribution using curve characteristics of at least one projection of an intensity of the received echo signals onto at least one respective axis, wherein the spatial distribution is defined using voxels and wherein the posture is estimated using high-power voxels as defined by a specified power threshold, and wherein the posture estimation further comprises characterizing postures of the at least one human with respect to at least a horizontal and a vertical axes.
 2. The method of claim 1, wherein the processing of the received echo signals is carried out by a back-projection algorithm.
 3. The method of claim 1, further comprising canceling environmental clutter by filtering out static non-human related echo signals.
 4. The method of claim 1, further comprising enhancing echoes received from a lower level in the environment to enhance detection sensitivity to a laying posture of the at least one human.
 5. The method of claim 1, further comprising detecting a position of the at least one human from the spatial distribution.
 6. The method of claim 5, further comprising tracking the detected position over time.
 7. The method of claim 6, further comprising canceling environmental clutter by filtering out static non-human related echo signals which are spatially characterized during an absence of the at least one human from the environment, as detected by the tracking.
 8. The method of claim 1, wherein the processing is carried out with respect to a plurality of antenna baselines as the at least one transmitting and/or receiving antennas.
 9. The method of claim 1, further comprising classifying the posture characteristics of the at least one human to indicate a state of the at least one human.
 10. The method of claim 9, wherein the classifying is carried out by preparing at least one codebook during a training phase and using the at least one codebook to classify the detected postures.
 11. The method of claim 9, wherein the classifying is carried out by identifying a most probable fit of one of a plurality of predefined states to the posture characteristics.
 12. The method of claim 9, further comprising generating an alert once the indicated state is related to at least one specified emergency.
 13. The method of claim 12, wherein the alert generation is based on pattern recognition with respect to previously indicated states.
 14. A non-wearable monitoring system comprising: an ultra-wide band (UWB) radio frequency (RF) interferometer configured to transmit UWB RF signals at, and to receive echo signals from, an environment including at least one human, a processing unit configured to process the received echo signals to derive a spatial distribution of echo sources in the environment using spatial parameters of the at least one transmitting and/or receiving antennas, and a feature extractor configured to estimate a posture of the at least one human by analyzing the spatial distribution with respect to echo intensity, wherein the posture estimation comprises analyzing the spatial distribution using curve characteristics of at least one projection of an intensity of the received echo signals onto at least one respective axis, wherein the spatial distribution is defined using voxels and wherein the posture is estimated using high-power voxels as defined by a specified power threshold, and wherein the posture estimation further comprises characterizing postures of the at least one human with respect to at least a horizontal and a vertical axes.
 15. The non-wearable monitoring system of claim 14, wherein the processing unit is further configured to cancel environmental clutter by filtering out static non-human related echo signals, process the received echo signals by a back-projection algorithm, and analyze the spatial distribution using curve characteristics of at least two projections of an intensity of the received echo signals onto a vertical axis and at least one horizontal axis.
 16. The non-wearable monitoring system of claim 14, further comprising: a human state classifier configured to classify the posture of the at least one human to indicate a state of the at least one human, and an abnormality situation pattern recognition module configured to generate an alert once the indicated state is related to at least one specified emergency.
 17. The non-wearable monitoring system of claim 16, wherein the classifying is carried out by preparing at least one codebook during a training phase and using the at least one codebook to classify the detected postures. 